Automated Benchmarking of KR-systems

نویسنده

  • Christoph Redl
چکیده

Benchmarking is an important part of scientific work on solving techniques for KR systems. The implementation of hand-crafted scripts for each benchmark problem is cumbersome and repetitive. While most benchmarks are similar such that the process appears to be largely automatable, there are also differences which inhibit a complete reuse of existing scripts, e.g., different parameters to be measured and different aggregation functions to be applied. This calls for a tool which is applicable out of the box for a large range of benchmarks, but still allows for easy customization if needed. In this paper, we present such a system for automated benchmarking, which we base on a formalization of customizable benchmarks. The system captures the whole benchmarking process, including the run of individual instances, extraction of relevant information from the command outputs, aggregation of the results, and generation of the final benchmark table. A single command can then be used to generate the final table in LTEX format which can conveniently be copied and pasted into a paper. In contrast to existing approaches, which usually focus on standardized settings such as in large-scale competitions, ours focuses on the possibility for customization, i.e., on benchmarks with possibly heterogeneous parameters and values to be measured, such as those which arise when evaluating new evaluation techniques.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evaluation of Ca-Independent a-Amylase Production by Bacillus sp. KR-8104 in Submerged and Solid State Fermentation Systems

This study investigates the production of crude Ca-independent and low pH active α-amylase by Bacillussp. KR-8104 in submerged fermentation (SmF) and solid-state fermentation (SSF) systems. Differentparameters were evaluated in each system using “one factor at a time” approach to improve the production ofenzyme. The results showed that in the SmF the maximum enzyme production ...

متن کامل

Cost Function Modelling for Semi-automated SC, RTG and Automated and Semi-automated RMG Container Yard Operating Systems

This study analyses the concept of cost functions for semi-automated Straddle Carrier (SC), Rubber Tyred Gantry (RTG) and automated Rail Mounted Gantry (RMG) container yard operating cranes. It develops a generic cost based model for a pair-wise comparison, analysis and evaluation of economic efficiency and effectiveness of container yard equipment to be used for decision-making by terminal pla...

متن کامل

Patent Application Publication ( 10 ) Pub . No . : US 2012 / 0173187 A 1 United States

(54) METHOD AND APPARATUS FOR Publication Classi?cation EVALUATING PERFORMANCE OF MOBILE (51) Int Cl TERMINAL G06F 19/00 (2011.01) _ G06F 11/34 (2006.01) (75) Inventors: W00 Kwang Lee’ suwonisl (KR); (52) us. Cl. ...................................................... .. 702/123 Dong Kun Shin, SuWon-s1 (KR) (57) ABSTRACT (73) Assignee? SAMSUNG ELECTRONICS COA method and an apparatus for evaluati...

متن کامل

A Container-centric Methodology for Benchmarking Workflow Management Systems

Trusted benchmarks should provide reproducible results obtained following a transparent and well-defined process. In this paper, we show how Containers, originally developed to ease the automated deployment of Cloud application components, can be used in the context of a benchmarking methodology. The proposed methodology focuses on Workflow Management Systems (WfMSs), a critical service orchest...

متن کامل

The ILTP Library: Benchmarking Automated Theorem Provers for Intuitionistic Logic

The Intuitionistic Logic Theorem Proving (ILTP) Library provides a platfom for testing and benchmarking theorem provers for first-order intuitionistic logic. It includes a collection of benchmark problems in a standardised syntax and performance results obtained by a comprehensive test of currently available intuitionistic theorem proving systems. These results are used to provide information a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016